Investigating festival's target cost function using perceptual experiments

نویسندگان

  • Volker Strom
  • Simon King
چکیده

We describe an investigation of the target cost used in the Festival unit selection speech synthesis system [1]. Our ultimate goal is to automatically learn a perceptually optimal target cost function. In this study, we investigated the behaviour of the target cost for one segment type. The target cost is based on counting the mismatches in several context features. A carrier sentence (“My name is Roger”) was synthesised using all 147,820 possible combinations of the diphones /n ei/ and /ei m/. 92 representative versions were selected and presented to listeners as 460 pairwise comparisons. The listeners’ preference votes were used to analyse the behaviour of the target cost, with respect to the values of its component linguistic context features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A classifier-based target cost for unit selection speech synthesis trained on perceptual data

Our goal is to automatically learn a perceptually-optimal target cost function for a unit selection speech synthesiser. The approach we take here is to train a classifier on human perceptual judgements of synthetic speech. The output of the classifier is used to make a simple three-way distinction rather than to estimate a continuously-valued cost. In order to collect the necessary perceptual d...

متن کامل

Target Detection in Bistatic Passive Radars by Using Adaptive Processing Based on Correntropy Cost Function

In this paper a novel method is introduced for target detection in bistatic passive radars which uses the concept of correntropy to distinguish correct targets from false detections. In proposed method the history of each cell of ambiguity function is modeled as a stochastic process. Then the stochastic processes consist the noise are differentiated from those consisting targets by constructing...

متن کامل

Investigating the Relationship between Religious Attitude and Perceptual Errors in Stock Exchange Investors

The perceptual error of the investors is one of the issues discussed in behavioral finance. The perceptual error is a wrong sensual or perceptual error.  That is, what we see or hear does not match with the reality. Regarding the fact that the perception of individuals is affected by their worldview and beliefs and religious attitudes can also affect their viewpoint, in this study, the relation...

متن کامل

Target Cost of F0 Based on Pol Concatenative Speec

This paper proposes a target cost function for F0 based on polynomial regression for use in concatenative speech synthesis. Polynomial regression is used to express the time series of F0 continuously, and remove effects of microprosody. We conducted a perceptual experiment and confirmed that the proposed function provides a higher correlation with perceptual scores than does the conventionally ...

متن کامل

Image authentication using LBP-based perceptual image hashing

Feature extraction is a main step in all perceptual image hashing schemes in which robust features will led to better results in perceptual robustness. Simplicity, discriminative power, computational efficiency and robustness to illumination changes are counted as distinguished properties of Local Binary Pattern features. In this paper, we investigate the use of local binary patterns for percep...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008